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5 FIELD OF THE INVENTION 

Present invention includes a mammalian cellular protein that associates with 
integrase (integrase interacting protein), and antibody or axtfiseose inhibition of said 
integrate interacting protein. The novel integrase interacting protein is a target for HIV 
10 replication prevention or inhibition, 

SUMMARY OF THE INVENTION 

15 

In a study of HIV- 1 integrase (IN) complexes derived from nuclei of human cells 
stably expressing the viral protein from a synthetic gene it was demonstrated that in the 
nuclear extracts IN exists as part of a large distinct complex with apparent Stokes radius 
of 61 A, which dissociates upon dilution, yielding a core molecule of 41 A, The IN 

20 complexes were isolated from cells expressing FLAG-tagged IN. By present invention it 
was demonstrated that the 41 A core is a tetramer of IN, whereas 61 A molecules are 
composed of IN tetramers associated with a cellular protein with an apparent molecular 
weight of 76 kDa. This integrase interacting protein (Inip76) was found to be identical to 
LEDGF/DFS70/p75, a protein implicated in regulation of gene expression and cellular 

25 stress-response. HTV-1 IN and Inip76 co-localized in the nuclei of human cells stably 
expressing IN. Furthermore, it has been demonstrated that recombinant Inip76 strongly 
promoted strand-transfer activity of HIV- 1 EST. Our findings reveal that the minimal IN 
molecule in human cells is a tetramer and clearly demonstrates that Icdp76 plays a role in 
retroviral integratioa 
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BACKGROUND OF THE INVENTION 

Establishment of the pro vims, a DNA copy of the viral genome integrated into the 
host cell chromosome is an obligatory step in retroviral replication. Stable integration into 
5 the human genome is one of the primary reasons for the persistence of the human 
immunodeficiency virus (HIV) infection, which leads to the acquired immunodeficiency 
syndrome (AIDS), Therefore, HIV IN, the enzyme orchestrating the insertion of the DNA 
replica of the viral genome into the cellular chromosomal DNA, is a potential target for 
antiretroviral therapy. The recent discovery of the new lead compounds able to protect 
10 colls against HTV infection by inhibiting integration raises hope that integrase inhibitors 
will be added to future combination cocktails (for reviews see Debyser et aL, 2002; 
Miller and Hazuda, 2001; Pani and Marongiu, 2000; Pommier et aL, 2000)). 

Mechanistically and structurally, retroviral integrases (reviewed in Asante-Appiah 

15 and Skalka, 1997; Brown et aL, 1987; Crafeie, 2001; Hindmarsh and Leis, 1999) are 
similar to the well-studied prokaryotic Mu and Tn5 transposases and belong to a family 
of DNA strand transferases that catalyze DNA strand cleavage/ligation via direct trans- 
esterification. In the course of viral integration, HTV IN performs two enzymatic 
reactions; first is the removal of the 3>- GT dinucleotides from the long terminal repeats 

20 (LTRs) (Le. the 3 '-processing reaction). The second reaction is the insertion of the 
recessed 3'- viral DNA ends into the opposite strands of the target DNA wherein the 3' 
hydroxyls of the processed LTR ends attack two phosphodiester bonds in the target DNA 
molecule (the strand-transfer reaction). Insertion of the two viral LTRs takes place in a 
coordinated fashion across the major groove of the target DNA, As a result, the integrated 

25 proviius is flanked by two 5-nucleotide gaps as well as two mismatched 5'-AC 
dinucleotides, which are probably repaired by cellular enzymes (Yoder and Bushman, 
2000). HIV IN, like other retroviral integrases, is comprised of three structural domains. 
The amino terminal domain contains the HHCC zinc finger motif, coordinates Zn and 
appears to be important for formation of the active IN multimers. The central, catalytic or 

30 "cortf* domain (residues 50-212) contains the active site (Le. the DDE motif composed of 
D64, D116 and El 52) and is structurally very similar to the catalytic domains of Mu 

2 
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phage and Tn5 transposases. hx the context of the full-length enzyme; the core domain is 
thought to make sequence-specific contacts with the DNA The caxboxy terminal 
fragment possesses unspecific DNA-binding activity and is critical for tiXultimerization of 
the enzyme. All single domain fragments of IN are dimeric when produced separately; 

5 the fill 1 -length HIV IN as well as other retroviral integrates form mdtimers under fee 
conditions of the in vitro enzymatic assays (Ellison et al. 7 1995; Engelman et al., 1993). 
Recombinant HTV and avian sarcoma virus (ASV) integrases have been shown to behave 
as mixtures of monomers, dimers and tetramers (Coleman et at., 1999; Deprez et al., 
2000; Jenkins et al., 1996). The presence of octamers and larger complexes has been 

10 suggested by some studies (Lee et al., 1997; Leh et al., 2000). Virion-associated HTV IN 
was shown to be in an oligomeric form, whereby dimers and higher order multicners 
appeared to be stabilized by disulfides, although the complexes were not studied under 
native conditions (Petit et al., 1999). Earlier, IN derived from avian myoblastosis virus 
(AMV) particles was demonstrated to be multimeric, behaving at least as a dimer 

IS (Grandgenett et al., 1978). 

The 3 '-processing and strand-transfer reactions can be reproduced in vitro using 
recombinant enzyme preparations and DNA substrates that mimic LTR ends. In vivo 
retroviral integration is preceded by the assembly of a stable and compact preintegration 
complex (PIC), that contains a 9.5 fcb linear DNA copy of the viral genome associated 

20 with viral and cellular proteins (Brown et al., 1987; Bukrinsky et at, 1993; Ellison et al., 
1990; Farnet and Haseltine, 1991; Miller et al., 1997). Several cellular proteins have been 
suggested to play auxiliary roles during retroviral integration. Thus, the banier-to- 
autointegration factor (BAF) has been reported to protect Moloney murine leukemia virus 
(MoMLV) PICs against suicidal self-integration (Lee and Craigie, 1998). Another 

25 cellular proton, HMG-I(Y) was found in HIV PICs and appeared to be essential for their 
integration activity in vitro (Farnet and Bushman, 1997; Miller et al., 1997). Conversely, 
BAF could substitute for HMG-I(Y) at least in vitro, partially restoring integration 
activity of salt-denatured HIV-1 PICs (Chen and Engelman, 1998). Yet, BAF rexnains to 
be Bhown to co-fractionate with retroviral PICs. Both BAP and HMG-IQO are small 

30 DNA-binding proteins able to bridge and deform DNA molecules (Reeves and 
Beckerbauer, 2001; Zheng et al., 2000). They are thought to play structural roles within 
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retroviral FlCs, possibly juxtaposing both LTRs. Similarly, Mu phage transposase as well 
as enzymatically-unrelated X phage integrate require the DNA-bending host proteins JHF 
and/or HU to form committed synaptic complexes (Friedman, 1992; MLzuuchi, 1992), 
JHF and HU appear to substitute for each other in promoting formation of the active X 
5 integrase-DNA complexes in vitro (SegaU et aL, 1994). Another potential co-factor for 
HIV integration, the integrate interactor 1 (Inil or hSNF5/BAF47), was originally 
discovered in a yeast two-hybrid screen for human protons interacting with HTV-1 IN 
(Kalpana et aL, 1994). Inil is a subunit of the 2 MDa SWT/SNF chromatinhremodeling 
complex (Wang et al., 1996). It has been proposed that Ml plays a role during retroviral 
10 replication by directing the PICs to open chromatin regions (Kalpana et al.» 1994) or by 
modulating expression of the integrated provirus (Turelli et al., 2001). Recent studies 
demonstrated that GFIMagged Inil was exported from the nuclei of infected cells and co- 
localized with incoming sub-viral particles (Turelli et aL, 2001). Inil has also been 
reported to enhance the release of infectious HIV particles; detectable amounts of Inil 
IS have been shown encapsulated within virions (Yung et aL„ 2001). 

Using a synthetic gene, we have been able to achieve efficient expression of 
HTV-1 IN in human cells (Cherepanov et aL, 2000). By present invention we have now 
characterized HTV-1 IN protein complexes present in nuclear extracts from cells stably 
expressing this viral protein. For the time we reported a HlV integrase-interacting protein 
20 that forms a distinct complex with IN in human cells and provided an insight into the 
oligomeric state of intracellular HIV IN, suggesting that the minimal cellular IN complex 
is a tetramer. It is an object of present invention to prevent or inhibit the integration of the 
HIV genome into the host cell chromosome by inhibiting the interaction between the 
integrate interacting protein and integrase. 
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ILLUSTRATIVE EMBODIMENT OF THE INVENTION 
Examples 
5 General methods 

Exam ple 1 : Recombinant DNA 

The HIV-1 integrase expression constructs were based on the episomal pCEP4 vector 
(Invitrogec^ Gromngen, The Netherlands). The plasmid pCEP-IN*ala is identical to the 

10 published pCEP-IN* plasmid (Cherepanov et al., 2000) with the difference that the Gly 
codon in the second poatioa of the synthetic open reading frame (ORF) was mutated to 
Ala. As a result, the construct expressed native HIV-1 IN with an addition of Met- Ala 
dipeptide at the N-terminus. To create the FLAG epitope-tagged IN expression construct 
pCEP-IN*alaFLAG, the IN synthetic ORF from pCEP-IN^ala was amplified in two 

15 consecutive steps with the sense primer 5 ^-GGCTAGAT ATC ACTAGCAAC 
CTCAAACAG plus two anti-sense primers 5 '-GTCGTCCTTGTAATCGCCGTCC 
TCATCTTGACGAGAG and S^GGCGCTCGAGTTACTTGTCATCGTCGTCCTTGT 
AATCGC, the resulting PGR fragment was digested with Xhol and cloned between the 
EcoRI (blunt) and Xhol sites of the pCEP4 vector. This plasmid expressed HIV-1 EM 

20 carrying the C-terrainal FLAG epitope (DYKDDDDK). The plasmid pRPlG12> for 
bacterial expression of Hiss-tagged HIV-1 IN, was a gift of Dr. R. Plasterk (The 
Netherlands). To obtain pCP6H75, the plasmid used for bacterial expression of BGs*- 
tagged Imp76/LED<5F, the PGR fragment amplified from a HeLa cDNA with the primer* 
S^GGCCGGATCCGACTCGCGATTTCAAACCTGGAGAC and S'-CCGCGAATTCT 

25 AGTTATCTAGTGTAGAATCCTTC was digested with BamHH and EcoRI and cloned 
between the BamHI and EcoRI sites of pRSETB (favitrogen). To prepare the miai-HTV 
IN substrate^ the plasmid pU3U5 (Cherepanov et al., 1999) was digested with Seal. 
fixmnpte 2; C$11 Culture 

The human embryonic kidney cells expressing SV40 large T antigen, 293T were obtained 
30 from Dr. O. Danos (Evry, France). Cells were grown in Dulbeeco's modified Eagle's 
medium (DMEM) (Invitrogen) supplemented with 10% fetal calf swim (PCS), 2 mM 
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glutamine and 20 jj.g/ml gentamidn at 37°C in 5% CO2 humidified atmosphere. To 
establish stable cell lines, 293 T cells were transfected by electroporation with integrase 
expression constructs and selected with 200 tig/ml of hygromycinB (Invitrogen). The 
selected cell lines, 293T-IN*ala and 293 T-IN*alaFL AG, expressed IN a* the levels similar 
5 to the previously reported 293T-IN 8 cells (Chexepanov et aL, 2000) as determined by 
western blotting and immunofluorescence. IN expression was stable for over 40 passages. 
For radioactive immunoprecipitation experiments cells were labeled in 
methionine/cysteine-free DMEM (Iiwitrogen) supplemented with 10% dialyzed PCS plus 
0,1 raCi/ml of TRAN 35 S-LABEL (ICN Biomedicals, Asse-Relegem, Belgium) fojf 
10 24 hours. 

Example 3 : Prepa ration of Nuclear Extracts 

293T-IN*aia or 293T-IN s alaFLA.G cells grown to a confluence Of 80-90% were 
dissociated from plastic dishes using trypsin, washed with PBS and re-suspended in the 

15 modified cytoskeieton buffer (10 mM Pipes [pH 6.8], 10% (w/v) sucrose, 1 mM DTT, 
1 mM MgCb plus the EDTA-free protease inhibitor cocktail (Roche, Brussels, Belgium)) 
(Fujita et al., 1997) containing 100 mM NaCl (referred to as 100CSK buffer). Cells were 
lysed for 10 min on ice with 0.5% NP40; nuclei were pelleted and washed with 100CSK. 
To extract IN, nuclei were re-suspended in the 400CSK buffer (same as 100CSJC, but 

20 containing 400 mM NaCl) and left on ice for 5 min, the chromatin was removed by 
centrifiigation at 7,500 rpm for 2 min. The total protein content of the nuclear extracts 
was measured using the BCA protein assay (Pierce, Rockford, IL), with bovine serum 
. albumin (BSA) as the standard. 

25 Example 4: Chemical Cross-linking 

The nuclear extracts were diluted using 400CSK buffer to adjust the total protein 
concentration to 100, 20 or 4 jagfrnl. DTSSP (Pierce, Erembodegem-Aalst, Belgium) was 
dissolved in water immediately prior to the experiment and used at 2, 0.5 or 0.1 mM. 
Cross-linking reactions were allowed to proceed for 15 min at room temperature, were 

30 terminated by addition of Va volume of the 4 x SDS sample buffer (200 mM Tris HCi [pH 
6.8], 4% SDS and 40% (v/v) glycerol) and incubation at room temperature for 20 min. 
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Example 5: G et Filtration Chromatography 

Nuclear extracts and immunopurified BSTf were fractionated on a Supeidex 200 HR 10/30 
gel filtration column ( Amersham-Pharmada) . The 400CSK buffer was used m all 
5 chromatography experiments. The column was operated at 0.6 ml/min, 4°C> and 
calibrated using low and high MW gel filtration standards from Amersham-Pharmacia 
(blue dextran; thyroglobulin, R«/Mw 85 A/669 kDa; ferritin, 61 A/440 kDa; catalase, 
52.2 A/232 kDa; aldolase, 48.1 A/158 kDa; BSA, 35,5 A/67 fcDa; chymotiypsinogen A, 
20.9 A/25 kDa). The sample volume was kept at 200 jil; fractions of 300 jil were 
10 collected and analyzed by western blotting using polyclonal accti-IN antibodies. When 
necessary, gel filtration fractions were concentrated by precipitation with trichloroacetic 
acid. Stokes radii (R«) and approximate MWs of the IN complexes were determined from 
their experimental partition coefficients (K* v ) as described (Siegel and Monty, 1965). 

15 Example 6 : Western B 'lotting and Immtmoprecipitation 

The gradient 4-12 and 4-20% Novex Tris-Glydbae gels were purchased from Invitxogen. 
Proteins were transferred onto PVDF membrane (Bio-Rad, Nazareth, Belgium); detection 
was done with ECL+ (Amersham-Pharmacia, Roosendaal, The Netherlands). The rabbit 
polyclonal anti-HIV-1 IN were home made and used at a final dilution of 1:30.000 
20 (Cherepanov et at, 2000). The anti-FLAG M2 monoclonal antibody was from Sigma- 
Aldrich, the monoclonal anti-DNA-PKcs Ab-4 cocktail from NeoMarkers (Fremont, CA) 
and the monoclonal anti-LEDGF p75/p52 from BD Biosciences (Erembodegem, 
Belgium). The affinity purified anti-hMCM3 polyclonal antibody was a gift from Dr, 
Knippers R. (Universty of Konstanz, Germany). A combination of prestained MW 
25 markers (New England Biolabs, Hertfordshire, UK) and Markl2 (Invitrogen) was used to 
estimate MWs of the cross-linking products and Inip76. DNAPKcs detected in a 293T 
nuclear lysate sample using the anti-DNA PKcs Ab4 antibody served as the 470 KDa 
marker in some western blots. 

In the initial immunoprecipitation experiments, 30 pil of protein Q agarose 
30 (Roche) and 1-3 \xg of the anti-ELAG M2 antibody was added to the xXuclear extracts 
prepared la 400CSK and diluted to obtain total protein concentration of 200 |Ltg/ml; the 
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suspension was strirred at 4°C overnight {12-18 hours). The agarose beads were washed 
once with 400CSK and 4 times with 100CSK plus 0.1% NP40. Protein was ehited in 
400CSK buffer by addition of 200 \xg/M FLAG peptide (Sigma-Aldrich) or in 
SDS PAGE sample buffer. To puiifiy LEDGF-INf complexes, immunopredpitation was 
5 carried out using undiluted nuclear extracts (700 jig/ral total protein) for 3-5 hours. To 
identify the Inip76 protein by N-terrainal sequencing and mass spectrometry, the 
procedure was upscaled. 293T-I>FalaFLAG cells were grown to confluency on five 
500 cm 2 dishes (VWR International, Leuven, Belgium), lysed with 0.5% KP40. IN 
complexes were extracted from the nuclear pellets into 13 ml of the 400CSK buffer and 
10 incubated with 300 \d of protein G agarose beads and 40 j^g of the anti-FLAG M2 
antibody for 4.5 hours. The HSk complexes were eluted in 700 of the 400CSK bufifer 
with 200 ug/ml FLAG peptide. 

Example 7 ; Awi nQ Terminal $equ?n<?mg cqidbfass Spectrq rnejrx 

15 Immunopurified IN*-Inip76 complexes were precipitated with trichloroacetic acid and 
redissolved in the SDS PAGE sample buffer. Approximately 3 ng of the Mp76 protein, 
electroblotted onto a Sequi-Blot PVDF membrane (Bio-Rad) from an SDS PAGE gel, 
was subjected to Edman degradation on a pulsed liquid phase Procise 491cLC protein 
sequencer (Applied Biosystems, Lenruk, Belgium), For mass spectrometry analysis the 

20 coomassie blue-stained band of Inip76 was cut from an SDS PAGE gel, destained in a 
200 mM ammonium bicarbonate/50% acetonitrile, air-dried and soaked in 6 yl trypsin 
solution (16 ng trypsin (Promega) La 50 mM ammonium bicarbonate) on ice for 20 min. 
" Following overnight digestion at 37°C, the supernatant was recovered, and the gel slice 
was extracted twice using 60% acetonitrile/0.1% formic acid. The extracts and the 

25 supernatant were pooled and dried in a Speedvac concentrator. The peptides were re- 
dissolved in 0.1% formic acid and analysed by on-line nanoflow high performance liquid 
chromatography tandem mass spectrometry (LC/MS/MS) on an UltiMate capillary LC 
system (LC-Pacfcings^ Amsterdam, The Netherlands) coupled to a Q-Tof mass 
spectrometer (Micromass, Manchester, UK) equipped with an electrospray ionization 

30 source. Technical details of this system are reported elsewhere (Devreese et aL, J. 

3 
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Chromatography A, in press). All spectra were processed using the MassLynx and 
MaxEnt software delivered with the mass spectrometer. 

Example 8 : Indirect Immunofluorescence Microscopy 
5 293T-IN 9 alaFLAG cells grown in Lab-Tek H glass chamber slides (VWR International) 
were fixed with 4% formaldehyde in phosphate buffered saline (PBS) fox 10 xnin and 
permeabilized in ice-cold methanol. The cells were further blocked in PBS supplemented 
with 20 mM ammonium chloride and 10% FCS and Uxcubated with rabbit polyclonal 
anti-FLAG antibodies (diluted 1:10.000 in PBS/10% FCS) (Sigma-Aldrich) and 

10 monoclonal anti-XEDGF (1:300) or anti-DNA PKcs (1;300) followed by AJexa-555 anti- 
rabbit and Alexa-488 conjugated anti-mouse IgG antibodies (Molecular Probes, Leiden, 
The Netherlands). The nuclear DNA was labeled with 5 To-Pn>3 iodide (Molecular 
Probes). Confbcal laser scanning fluorescent microscopy and imaging was carried with 
an LSM510 system (Carl Zeiss, Zaventem, Belgium) using a 488 xxm Ax ion laser with a 

15 505-530 cm band pass filter for Alexa-488, a 543 nm HeNe la$$f with a 565-615 nm 
filter for Alexa-555 and a 633 nm HeNe laser with a low pass 650 nm filter for ToPro-3. 
All acquisitions were done in the multi-track mode. 

Example 9: Recombinant proteins 
20 The His$"tagged HTV-1 IN was produced from the pRP1012 plasmid in the Endo I-free 
host Escherichia cofi strain PCI (BL21(DE3), Acti<£1::Tc r > pLysS) (Cherepanov et al., 
1999). The protein was purified from the soluble fraction by chromatography on Ni-NTA 
. agarose (Qiagen, Ifilden, Germany) and Heparin Sepharose (Amersham Pharmacia, 
Uppsala, Sweden) in the presence of 7.5 mM 3-[(3- 
25 cholaraidopropyl)dimethylammomo]propanesulfo (CHAPS) (Sigma-Aldrich). The 
Hiss-tag was removed by incubation of the purified protein with thrombin (Novagen). 
The His<rtagged Inip76 was expressed from the plasmid pCP6H7S in PCI by induction 
with 1 mM isopropylthiogalactopyranoside (DPTG) at 29°C for 3 hours in LB medium. 
The soluble protein was then purified by batch adsorption to Ni-NTA agarose and 
30 chromatography on a 1 ml HiTrap Heparin Sepharose column (Amersham Phamacia), 
The protein was ehrted from the Heparin Sepharose column using a linear NaCi gradient 
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iu 30 mM TrisHCl![pH 7.0]. Peak fractions collected at approximately 800 mM NaCl 
were pooled and concentrated using Centricon-30 (Millipore, Brussels, Belgium) to 
obtain the final protein concentration of 1 mg/ml. The protein was estimated to be at least 
95% pure by SPS PAGE. 

5 

Results 

V^nrn plA 10 : HIf-1 I N is present in the insoluble nuclear frqctiofl 

The 293T-IN 5 ata cell line used in this work was similar to the previously reported 293T- 

JO IN*, except that it expresses HTV-1 IN with the Met- Ala dipeptide at its N-terminus 
(instead of Met-CHy) (Cherepanov et at, 2000), This change was introduced to prevent 
potential myristoylation of the protein (Boutin, 1997) and affected neither cell line 
stability nor IN expression levels. The integrase protein was nuclear in both cell lines as 
determined by indirect immunofluorescence microscopy and Cherepanov et al, 2000). 

15 After centrifugation of the digftonin- or NP40-treated 293T-INPaUt cells, most of the IN 
protein was retained in the nuclear pellet (Figure 1 A). Since TSJP40 perraeabilizes both the 
plasma membrane and the nuclear envelope, the bulk of IN present in the cell is thus 
stably associated with insoluble nuclear structures. DSf was extracted from the nuclei by 
increased salt concentrations; 350 r 500 mM NaCl was sufficient to elute the protein 

20 (Figure IB). Therefore, buffers supplemented with 400 mM NaCl were used for 
preparing nuclear salt extracts. 

Some chromatin-associated proteins can be eluted from detergent-permeabilized 
nuclei by gentle treatment with DNAsel or micrococcal nuclease (Fujita et al, 1997; 
Holthoff et al, 1998; Meller and Fisher, 1995). We observed no ehrtion of IN when 

25 nuclei prepared from 293T-IN*ala cells were treated with DNasel (Figure lC), that 
completely digested nuclear DNA to fragments of less then 200 bp. In accordance with 
Fujha et al. (Fujita et al., 1997), MCM3, a chromosomal replication factor, was removed 
from the nuclei by gentle DNase I treatment (Figure 1C). The same salt concentration 
was required to extract IN from the nuciease-digested nuclei. However, subjection of the 

30 DNase-digested nuclei to low ionic strength conditions led to efficient ehrtion of the 
protein (Figure ID). Non-digested nuclei did not release any detectable amount of IN in 

10 



|004a303:#:26gS^gg:0?103yl 



26- Sep. 2002 21:52 ■■■HKM8 32bDI?) 



Nr-auoa r. 1//00 



the hypotonic conditions (Figure ID). These results indicate that IN is associated both 
with chromosomal DNA and with some other nuclear structure, which is destabilized hi 
low ionic strength conditions. Similar results were obtained when 293T-ESfaia nuclei 
were exposed to micrococcal nuclease. 

5 

Example 11: Chemical Cross-linkinv of IN Complexes Present in Nuclear Extracts 

We used S^'-DftMobisEsulfosuccinimidyl propionate] (DTSSP), an amine-specrfic N~ 
hydroxysuccinixnid ester with a 12 A spacer arm, to cross-link protein complexes present 
10 in the nuclear extracts of 293T-IN*ala cells. The nuclear proteins were extracted from the 
NP40-treated 293T-IN*ala nuclei using the cytoskeleton buffer containing 400 mM NaCl 
(400CSK). The total protean concentration was adjusted to 100, 20 and 4 \igfwl; the 
samples were incubated with DTSSP and separated in a non-reducing 4-12% SDS PAGE 
gel. IN-containing cross-linking adducts were detected by western blotting using 
15 polyclonal axxti-IN antibodies. A typical result is shown in Figure 2A In the nOtt-arossr- 
linked samples (lanes 2, 6 and 10) as well as in the samples cross-linked in the presence 
of SDS (lane 1) only IN monomer and a band corresponding to IN dimer (approximately 
60 kDa) were apparent. Addition of 0. 1-2 mM DTSSP yielded crossr-linked complexes of 
60 kDa (p60 ol ) 3 150-180 kDa (the pl50 d band, clearly visible in lanes 7, 8 and 12), 250- 
20 350 kDa (p300* ! , lanes 7 and 8) and less resolved higher molecular weight (MW) species. 
Strikingly, detection of the cross-linked IN complexes with our polyclonal anti-IN 
antibodies was far more sensitive than detection of the non-cross-linked IN (compare the 
lanes 2 and 4, for ©sample). Probably, some strong conformational epitopes are better 
preserved within cross-linked IN during SDS PAGE and western blotting. No unspecific 
25 bands were revealed in nuclear extracts from 293T cells before and after cross-linking 
with DTSSP confirming that all bands detected in the western blot correspond to IN- 
containing complexes. The cross-linking of the 293T-IN*ala nuclear extract was clearly 
dependent on the concentration of both DTSSP and protein. Saturating cross-linking of 
diluted protein samples followed by denaturing SDS PAGE has been used to determine 
30 MWs of native protein complexes (see (Corey et ai. ? 1998) and references therein). 
Cross-linking of the diluted nuclear extract (4 jxg/ml protein) with 2 mM DTSSP yielded 
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a band of approximately 150 fcDa (pi 50% lane 16), while in more concentrated extracts 
(100 ixg/ml) a diffuse band of 250-300 fcDa was the most prominent (p300* lane 8). 
Thus, there exist at least two different IN complexes: a large complex at higher protein 
concentrations and a smaller complex in the diluted extract. The pl50 d product seems to 
5 be the result of complete cross-linking since no significant change in cross-linking occurs 
when the concentration of DTSSP was increased from 0.5 to 2 mM (compare lanes 15 
and 16) and higher (not shown). Thus, plSO* 1 probably represents the IN complex present 
in the diluted nuclear extract Moreover, this complex is a dissociation product and 
importantly, a component of the larger complex, since it appeared as a partial cross- 
1 0 linking adduct in the reactions with more concentrated protdxx extracts at 0,5 toM DTSSP 
and it decreased at 2 mM DTSSP (compare lanes 7 and $). However, the plQQ* band is 
probably not the reBuh of complete cross-linking of the larger complex, since a strong 
smear and some less resolved bands are present above p300 d on the western blot; 
aggregated material not able to enter 4% acrylamide gel is also evident Qanes S and 12). 
15 Some of the high MW adducts in the reactions with 100 ng/ml extracts may result from 
non-specific inter-molecular cross-Unking of proteins. Cross-linking of IN complexes in 
the nuclear extracts using oxidizing Cu 24 *-[1, 1 0-phenanthroline]3 complex (Cys-Cys cross- 
linker (Ji, 1983)) were also suggestive for the presence of a large protein complex that 
dissociated upon dilution releasing a molecule of approximately 150 kDa. 

20 

Example 12: Anvarent Stokes Radii of the two Nuclear Inteerase Complexes 
To confirm the presence of both IN complexes and deduce their size we used gel 
. filtration. Undiluted (550 ngfaol protein) and diluted (30 ng/ral) nuclear salt extracts from 
293T-EN fl ala cells were run on a calibrated Superdex 200 column and the IN elution was 

25 followed by immunoblotting the collected fractions (Tigure 3A). We observed two 
distinct elution volumes corresponding to two different IN complexes. After 
chromatography of the undiluted extract, IN eluted symmetrically with a peak maximum 
in fractions 8 and 9 (elution volume, V c « 11.2 ml; Stokes radius> R* = 61 A, see Figure 
3JB), However, IN eluted later and in a broader peak with a V c of 14.1 ml (R* = 41 A, 

30 Figures 3 A and 3B) when the sample was diluted prior to gel filtration. Assuming both 
complexes are gjobular, their MWs can be calculated to be 380 kDa and 115 kDa 
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respectively (Figure 3Q. The smaller dilution-resistant molecule (R* ■» 41 A) most likely 
corresponds to the plSCf* cross-linked complex observed m the. previous experiment, 
while partial cross-linkittg of the 61 A IN complex probably resulted in P300 01 . The 
effective dilution of the peak fractions on the column was 5- to 7- fold as calculated from 
5 calibration runs. Thus, the sample concentrations used in our gel filtration experiments 
corresponded to those used in the cross-linking experiments (100 pgAnl and 4 Mgfrnl). 

Example 15: Purification and Characterization of FLAG-Ta^ved IN Complexes 

To facilitate isolation of native IN complexes from cell extracts we modified the IN 

10 expression construct adding the FLAG epitope tag at the caxboxy terminus of IN. The 
293T-IN*alaFLAG cell line, obtained by stable transfection of 293T cells with the tagged 
expression construct, was very similar to 293T-IN*ala in stability and levels of IN 
expression (not shown). FLAG-tagged IN (INf) localized predominantly in the nuclei in a 
diffuse pattern and was associated with chromosomes during mitosis (see below) as has 

15 been previously reported for non-tagged HlV-1 IN (Cherepanov et al, 2000). INf could 
be extracted from the nuclei of 293 T-IN*alaFL AG cells in the same conditions as for non- 
tagged IN. The cross-linking pattern of INf with DTS SP was very similar to that of non- 
tagged IN (Figure 2B). The two major cross-linking products of INf showed slightly 
slower migration in SDS PAGE gels than the original plS0 cl and p300 cl , which can be 

20 attributed to the negative charge of the FLAG tag and increased molecular weight of the 
tagged protein For convenience^ however, we refer to the INf cross-linking adducts as 
pi 50°* and p300°\ The gel filtration profiles were as observed for the non-tagged IN 
extracted from 293T-IN I ala cells (see below). 

Xu initial immunoprecipitation experiments, we incubated diluted nuclear extracts 

25 from metabolically labeled 293T-IN*alaFLAG cells with the anti-FLAG M2 antibody and 
protein G agarose overnight. Hie protein isolated in this way displayed a single specific 
band in SDS PAGE gels migrating at the expected position for the FLAG-tagged IN 
(33.5 kDa) (Figure 4A). Isoelectrofocusing of immunoprecipitated INf in denaturing pH 
gradients showed a major band close to the predicted pi (6.5), which reacted with anti-IN 

30 serum in immunoblot When the INf immunoprecipitated from a nuclear extract of 293T- 
IN*alaFLAG cells was eluted from the anti-FLAG M2 antibody with synthetic FLAG 
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peptide and incubatedwith DTSSP, the j>l$0* cross-UxJdng product was readily obtained 
(Figure 4B). When higher BSfr concentrations were used in atoss-Knking, the 
immunoreactive ruction products accumulated at the top of the gel, suggesting 
aggregation of the protein. We were not able to find reaction conditions to reproduce the 

5 p300 d cross-Unking product with INr preparations purified this way. Fractionation of 
purified INr on a Superdex column showed a peak with a K«v value very close to that of 
the 41 A complex (Figure 4C). The fact that the 41 A complex exists in the purified IN f 
preparation and the apparent MW (115-150 kDa), based on cross-linking and gel 
filtration experiments, suggest that the 41 A molecule is a tetramer of IN. Apparently, the 
10 native 61 A IN complex was not stable enough to withstand immunoprecipitation under 
these conditions. Next, we tried shorter immunoprecipitation times (3-5 hours) starting 
from more concentrated nuclear extracts. Although the overall yield of INr decreased, the 
immunopreciphated samples were found to contain an additional protein. It had an 
apparent MW of approximately 76 kDa, as detennined by SDS PAGE (Figure 5 A) and 

15 was present at variable ratios to INf in different preps. This protein, here referred to as 
Imp76 (for 76 kDa IN interacted protein), is specifically associated with INf, since, it 
could not be immunoprecipitated from the parental 293T cells with the anti-FLAG 
antibody (compare lanes 2 and 3 in Figure 5A). When undiluted nuclear extract fiom 
293T-IN*aiaFLAG cells was immunoprecipitated with anti-FLAG antibody for 4 hours 

20 both INf and Inip76 bands were readily detected (Figure 5B). Although extending 
immunoprecipitation to 16 hours improved INf recovery, the yields of Inip76 were greatly 
reduced (compare the lanes 1 and 2 in Figure 5B). Intriguingly, the p300 cl band, detected 
after DTSSP cross-linking of the nuclear salt extracts, was readily observed when the 
Inip76-containing INf preparations were cross-linked with DTSSP (Figure 6E), 

25 suggesting that Inip76 is part of a large IN complex present in the nuclear extracts. 

Exam ple 14: Identification of the I rri p76 Protein as LRDGF/DFS70/r?75 
By upscaling immunoprecipitation we were able to isolate sufficient amounts of Inip76 
for characterization by Edman degradation and mass spectrometry (Figure 6A). The 
30 amino terminal sequence obtained from Inip76 was XXKDFKPGD (the first two residues 
being not resolved due to background noise). Scanning the TrEMBL protein database ft* 
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human proteins carrying this sequence tag (Wilkins et ai, 1996) resulte4 ™ four hits with 
accession numbers 095368, Q9UER6, Q9NZI3, 075475, all cotrespdnding to the two 
alternative products of one gene: LEDGF/DFS70/p75 (lens epithelium-derived growth 
factor, referred hereafter as LBDGF) and the pS2 protein ((Ge et aL, 1998, Singh et aL, 
5 2000)). Although the actual MW of LEDGF is approximately 60kDa, it has been 
reported to migrate as a 75 kDa band in SDS PAGE gels (Ge et aL, 1998). On-line liquid 
chromatography tandem mass spectrometry analysis (LC/MS/MS) of tryptic peptides 
obtained by in-gel digestion of Inip76 provided further evidence that Imp76 is indeed 
identical to LEDGF. Lens epithelium-derived growth factor (LBDGF), is a member of 

10 the hepatoma-derived growth factor family, is found at low levels in many actively 
dividing and long lived cells. Its gene yields two proteins, LEDGF/p75 and p52» by 
alternative splicing (Ge, Si, Y., and Roeder, K G. (1998) EMBO J. 17, 6723-6729 
and 2). LEDGF belongs to a family of homologous proteins including hepatoma-derived 
growth factor (HDGF) (Nakamura, H, Izumoto, Y, Karabe, H, et al (1994) Molecular 

15 cloning of complementary DNA for a novel human hepatoma-derived growth fecfcon its 
homology with high mobility group-1 protein J Biol Chem 269,25143-25149) and 
HDGF-related protein-1 and -2.( Izumoto, Y, Kuroda, T, Harada, H, Kishimoto, T, 
Nakamura, H. (1997) Hepatoma-derived growth factor belongs to a gene family in mice 
showing significant homology in the amino tenninus Biochem Biophys Res Commun 

20 238,26-32) 

Half of the predicted LEDGF tryptic peptides within the masB range of 1000-2500 Da 
could be identified in the sample and their MS/MS spectra readily matched LEDGF 
covering approximately 18% of its sequence (Figure 6B and Tablel). Moreover, Inip76 

strongly reacted with a commercially available monoclonal ajrtti-LEDGF antibody (see 
25 below). Most of ENfe present in nuclear extracts could be immunoprecipitated with the 

anti-LEDGF antibody, whereas only about 10 % of LEDGF could be recovered with the 

anti-FL AG antibody (Figure 6C), suggesting that LEDGF is present in an excess over DNV 
in the extract. Non-tagged IN could also be efficiently precipitated with the anti-LEDGF 

antibody, and a fraction of the LEDGF could be precipitated with polyclonal anti-lN 
30 antibody in similar conditions from nuclear salt extracts prepared from 293T-INf J ala cells. 
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Example IS: Imp76is part of the 61 A HIV IN complex 

To determine whether the 61 A complex contains Inip76/LEDGF, we pre-incubated the 
nuclear salt extract from 293 T-IN s alaFL AG cells with a monoclonal anti-LEDGF 
antibody prior to chKHnatography on a Superdex column. The n% elution profile changed 
5 dramatically: the peak eluted now near the void volume of the column (Figure 6D). 
Ration of the 61 A complex was not altered by pre-mcubation of the extract with an 
unrelated mouse IgGl (Figure 6D). Predictably, eiution of the 41 A IN complex (the 
presumed IN tetramer) did not change after pre-incubation of the diluted nuclear extracts 
with the anti-LEDGF antibody. 

10 When INrInip76 complex was purified by immunoprecipitation and cross-linked 

with DTSSP, the p300 d band could be readily detected in an immunoblot with anti-IN 
antibody (Figure 6E). However, p300* did not react with, a monoclonal anti-LEDGF 
antibody; instead a western blot with the anti-LEDGF antibody revealed two bands 
migrating at higher positions in the gel (pHMWi d and pHMW 2 cl , Figure 6E) (the MWs of 

15 these molecules are too high to be determined from SDS PAGE). These results suggest 
that p30Q* is a product of incomplete cross-linking of the 61 A BSf-Inip76 complex. Being 
composed of IN alone, p300* probably represents an octaraer of IN (a dimer of 
tetramers). Contacts between Inip76 and IN within the 61 A complex may be less prone 
to cross-linking with DTSSF than those between IN protomers. We cannot, however, 

20 exclude the possibility that the target epitope for the monoclonal anti-LEDGF antibody 
used is masked or destroyed within pSOO* 4 . In addition to the major p300 d product, a band 
at a position close to pHMWi d is present on the anti-IN immunoblot of the purified and 
cross-linked INf»Inip76 complex (Figure 6E, lane 2). Thus, the pHMWi* adduct is 
( probably the smallest cross-linked IN complex containing Mp76. 



Example 16 : Inip76 co-localizes with IN within nuclei of293T-IlfalaFLAG cells 
In mammalian cells, LEDGF is expressed as a nuclear protein; it is distributed in a 
diffuse manner throughout the nucleus during interphase and is stably associated with 
condensed chromosomes throughout metaphase and anaphase (Nishizawa et al., 2001). A 
30 similar subcellular distribution has been reported for HIV-1 IN (Cherepanov et al, 2000), 
Immunofluorescent detection of both INf and Icdp76/LEDGF in fixed 293T-EN*ala cells 
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revealed strikingly similar intranuclear distribution patterns for both proteins (Figure 7 A), 
la accordance with previous reports, both proteins were bound to condensed 
chromosomes in mitotic cells (Figure 7B). The distribution of another nuclear protein, the 
catalytic subunit of DNA-dependeni protein kinase (DNAPKcs) clearly differed from 
5 that of IN f (Figure 7C). In addition, DNAPKcs was excluded from condensed 
chromosomes in mitotic cells, Ihtriguingly, the nuclear localization of INf and Inip76 did 
not correspond to the overall DNA staining pattern, arguing against the possibility that 
the apparent co-localization of the two proteins might be merely due to their independent 
association with chromosomal DNA In a control experiment, we visualized INf using a 
10 mixture of polyclonal and monoclonal anti-FLAG antibodies; the obtained two-color INf 
staining was similar to thai of INf and Inip76. 

Sample I7;lmp76i^an activator pffffV-Jt ffl in yiiro 

15 We have previously described activities of recombinant HIV-1 IN on the mrai-HTV 
substrate, a linear 4,7 kb double stranded DNA molecule, carrying the U3 and U5 
terminal fragments of the viral LTR sequences (Cherepanov et aL, 1999). Recombinant 
HTV-l IN on itself was proficient in carrying-out cleavage and strand transfer using this 
long DNA substrate; however, presence of 5-12% polyethylene glycol (PEG) in the 

20 reaction was required for the enzymatic activity. To ascribe a possible function to the 
observed Inip76-IN interaction, we examined whether Inip76 could modulate enzymatic 
activity of HIV-1 IN in vitro. The mini-HIV substrate was incubated with recombinant IN 
and His$-tagged Inip76 in the absence of PEG, and the reaction products were analyzed 
by native agarose gel electrophoresis (Figure 6F). In the absence of Inip76> strand transfer 

25 products were almost undetectable (lane 3). Addition of Ihip76 resulted in a potent 
stimulation of the reaction. In some conditions, approximately half of the substrate DNA 
was converted into various reaction products including those that were too large to enter 
the gel (lanes 7 tod 8). Remarkably, both the overall efficiency of the reaction and the 
range of the strand transfer products depended on the concentration of Inip76. No 

30 significant variation in the yield of the strand transfer products was detected when the 
order of addition of Inip76 and IN to the raini-HIV reaction was reversed. Although 
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addition of mip76 to IN reactions containing short oligonucleotide substrate stimulated 
the strand transfer activity, presence of PEG and DMSO was still essential for efficient 
Mg 2+ -dependent activity of our recombinant enzyme preparation (Cherepanov et aL, 
1997)). 
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Discussion 



The minimal unclear HTV-1 IN complex is a hotnotetramer 

A serious obstacle in working with recombinant retroviral integrases is their poor 
5 solubility and propensity for aggregation- When produced in bacteria, HTV IN has to be 

extracted from inclusion bodies; detergents are used to stabilize the enzyme in solution. 

All crystal structure and some in vitro multimerization studies have been done with the 

soluble mutants. Furthermore, recent reports raised concern that the stoichiometxy and 

enzymatic activities of recombinant IN can be affected by the enzyme preparation (Leh et 
10 al. 3 2000; Stoha et aL, 2002). Our goal was to study protein complexes that HIV-1 IN 

forms within nuclei of human cells. The bulk of HTV-1 IN present in 293T cells that 
stably produce this viral protein, is associated with the insoluble nuclear fraction. 

Although IN seems to be directly or indirectly bound to chromosomal DNA, this may not 

be the only factor in nuclear retention of IN, since digestion of the detergent- 
15 permeabilized nuclei with nucleases was not sufficient to elute IN, In this work, we 
concentrated on the study of IN complexes extracted from the detergent-p«rmeabilized 

nuclei in hypertonic conditions. We found that salt-etuted IN exists as part of a distinct 
61 A complex, which is not stable in diluted nuclear extracts and dissociates, releasing a 

41 A core molecule. Our cross-linking and gel filtration data suggest that the latter 
20 molecule is a tetramer of IN. Indeed, the complex exists in immunoprecipitated IN* 
preparations. Its molecular weight determined both by gel filtration and cross-linking 
(115-150 kDa) is close to the value expected for the presumed tetramer (128 kDa). 

Finally, partial cross-linking of the diluted nuclear extracts and of the purified tagged IN 
protein produced bands at the positions corresponding to the IN monomer, dimer and 
25 trimer, in addition to pl50 cl (Figure 2 A, lane 7). We estimate that the concentrations of 
IN and IN f in the nuclear extracts did not exceed 10 nM (< 03 |ig/ml IN at 100 \xg/ml 
total protein). Therefore, the IN tetramer was stable even at subnanomolar concentrations 
(i.e. in the extracts diluted to 4 ixg/ml of total protein), implying that the minimal nuclear 
IN complex is a tetramer. 

30 

Composition of the 61 A HTV-1 IN complex 
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The native 61 A IN complex was found to decay during prolonged incubations and not to 
be stable during purification. When the immunoprecipitation time was shortened and 
higher initial extract concentrations were used,, a cellular protein with an apparent MW of 
76 RDa specifically co-precipitated with the epitope-tagged IN* The protein, or Inip76, as 

5 we refer to it, was found to be identical to human LEDGF/DFS70/p75 protein. We further 
confirmed that Inip76/LEDGF is a part of the native 61 A IN complex, since addition of a 
monoclonal anti-LEDGF antibody shifted elution of the complex from a gel filtration 
column. Importantly, when LEDGF protein was present, the p300 a band became visible 
after DTSSP cross-linking of the immunoprecipitated IN* This band could not be 

10 reproduced in IJBDGF-negative INf samples obtained following the long 
immunoprecipitation protocol. P300 d seems to be the result of partial cross-linking of the 
61 A IN complex for the following three reasons: (/) the MW was lower than the 
expected value for a 61 A globular complex (*400kDa); (ii) DTSSP cross-Unking 
products with apparent MWs higher than that of p300 cl were visible on antMN 

15 immunobiots; (///) this product did not react with anti-LEDGF antibody. Based on itB 
apparent molecular weight, we speculate that p300 rf probably represents two cross-linked 
IN tetramers. 

What is the exact stoichiometry of the IN-Inip76 complex? At the concentrations 
it is present in nuclear extracts, the 61 A complex was not stable enough to allow 

20 measurement of its sedimentation coefficient, which is required to determine its precise 
MW (Siegel and Monty, 1965). Assuming that the 61 A complex is globular, we estimate 
its MW to be around 400 kDa. The simplest model compatible with this MW is of a 
symmetrical complex containing a pair of IN tetramers and two subunits of Inip76 7 which 
corresponds to a macromolecule of 370 kDa. Reconstitution of the IN-Inip76 complex 

25 from the recombinant proteins will help to confirm the proposed stoichiometry. At thiB 
time, we cannot rule out the possibility that the native 61 A complex contains another 
cellular protein lost during immunoprecipitation. The purified Inip76-containing IN 
samples displayed complex gel filtration profiles, probably due to partial dissociation of 
the native complex. 

30 

Retroviral IN within PIC; a dimer of dimers or a dimer of tetramers? 
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During reverse transcription, the two retroviral cDNA termini are not completed 
simultaneously and both seem to be substrates for the 3 '-processing activity of IN as soon 
as they appear (Miller et al., 1997). Moreover, at least in case of HIV, 3*-processing of 
one LTR end was observed when the second end was non-functional and also did not 
support normal intasome assembly (Chen and Engelman, 2001). On the other hand; two 
functional LTRs were found to be required for strand transfer activity of isolated HIV 
PICs. Therefore, although LTRs can be processed asymmetrically, a synaptic complex 
involving both LTRs must form to allow strand transfer, ensuring that only legitimate 
integration of both retroviral cDNA ends occurs. Based on a comparison to Mu phage 
transposase and available crystal structure data, it has been suggested that the active form 
of retroviral IN is a tetramer (a dimer of dimers) (Wang et al., 2001). Our results suggest 
that HIV IN expressed in human cells is indeed present as a stable tetramer. Recombinant 
HIV-1 IN as well has been recently reported to form stable tetramers in diluted solutions 
(Leh et al, 2000). Intriguingly, both Mu phage and Tn5 transposases form active 
15 multimers (tetramers and dimer respectively) only within their synaptic complexes. Thus, 
independent transposase subunhs must first bind to the ends of the transposon genome, 
before being brought together to form the synaptic complex. Extrapolating this scheme to 
the retroviral HC assembly, the stable IN tetramers can be looked at as such independent 
subunhs, which first have to bind to one LTR end each before interacting with each other. 
20 According to this model, each individual tetramer would be capable of currying out 3'- 
processing reaction, whereas a dimer of tetramers would be necessary to accomplish the 
strand transfer. The data presented here suggest that the cellular HIV-1 IN 61 A complex 
contains two IN tetramers that are close enough to be cross-linked to each other. It is 
therefore very tempting to speculate that this dimer of tetramers may represent the 
25 configuration of IN required for concerted integration. 

What is the role of Inip76 in retroviral replication? 

The human Liil/hSNF5/BAF47 protein has been shown to interact with HIV-1 IN in a 
yeast two hybrid screen and in vitro. Neither did we observe co-iramunopredpitation of 
30 Inil with IN from nuclear extracts; nor were we able to detect co-localizatian of the two 
proteins in the cell (to be published elsewhere). We speculate that the cellular IN-M1 
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interaction might be transient and only occurring during certain steps of viral replication. 
We have now shown that IN complexes extracted from the cell lines stably expressing 
HEV-1 IN contained another human protein, Inip76, identical to LEDGF/DFS70/p75. IN f 
and Inip76 co-localized in the nuclei of 293 T-IN°alaFL AG cells, confirming that the 
5 observed interaction was not an artifact of the extraction. So far, there has been no 
previous account of a link between LEDGF/DFS70/p75 and retroviral replication. No 
obvious relation seems to exist between this protein and known chromatin remodeling 
factors. The protein has first been described as the positive transcription cofactor PC4- 
interacting protein (Ge et al., 1998). Two afternatively spliced cDNA clones were isolated 
10 from a HeLa library coding for two proteins p52 (333 amino acids, as predicted from the 
cDNA) and p75 (530 aminoacids) sharing identical 325 amino terminal residues. The 
transcripts coding for p75 and p52 were detected in different cell types and tissues, p52 
being most abundant in testis and p75 in thymus. Independentiy, a d>NA clone coding 
for a protein identical to p75 has been isolated from a lens epithelium cell library (Singh 
15 et at, 1999). Overexpression of the protein stimulated survival of diverse primary cells 
and cell lines and enhanced their resistance to oxidative and hyperthermic stress (hence 
1HDGF, "lens epithelium derived growth factor"). The same protein has also been 
identified as the DFS70 autoantigen, antibodies to which were found in some cases of 
atopic dermatitis, asthma and interstitial cystitis <Ochs et aL, 2000). One case of acute 
20 myeloid leukemia with a chromosomal translocation resulting in a fusion of the NUP98 
and LEDGEF genes has been described (Ahuja et al., 2000). 

Based on sequence similarity, Inip76/LEDGF/DFS70/p75 is a member of the 
hepatoraa-derived growth factor (HDGDF) family that includes HDGF and several other 
HDGF-related proteins (HHPs) (reviewed in (Diet* et al, 2002)). High degree of 
25 homology exists between the amino terminal regions of these proteins. The PWWP motif 
(70 residues containing the Pro-Trp-Trp-Pro core sequence) is located within the amino 
terminal homology region of HRPs and relates them to a larger and functionally diverse 
nuclear protein family, that includes DNA-bmding transcription factors and enzymes 
involved in DNA repair and DNA methylation. PWWP domains are thought to be 
30 implicated in protein-protein interactions (Stec et al., 2000). 
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An eye-catching feature of Inip76 is the abundance of the charged amino acids 
comprising approximately 40% of the total residues. Screectog its sequence against the 
BLOKS+ protein motif database (Henikoff et al., 1999) revealed fragments with 
similarity to the HMG-I(Y> DNA AT hook sequence, LEDGF has been shown to be a 
5 DNA-binding protein with affinity for heat shock and stress-related DNA elements (HSB 
and STREs) (Singh et al„ 2001). It has also been reported to interact with components of 
the general transcription machinery and with the transcription activation domain of VP16 
(Ge et al., 1998). Recent reports suggested that LEDGF plays an important role in 
regulating expression of the stress-response genes (Fatma et at, 2001; Sbinohara et al., 
10 2002). As mentioned above, alternative splicing of LEDGF prenraKNA, allows 
expression of the second protein, p52„ from the same gene. A growing body of evidence 
suggests that p75 and pS2 may have different functions. Although they both can interact 
with PC4, VP16 and general transcription factors, at least in vitro, p52 displays/higher 
transcription activation activity (Ge et al., 1998). In addition. p52 and not p75 has been 
15 shown to functionally interact with the A3F/SF2 splicing factor in vitro (Ge et al,, 1998). 
The proteins also differ in their nuclear distribution patterns (Nishizawa et al., 2001). 
Intracellular levels of p52 appear to be much lower than those of p75, at least in the cell 
lines we have tested (HEK-293, 293T, HeLa, GEM). We have not detected CO- 
immunoprecipitation of p52 with IN from the nuclear ceil lysates of IN-expressing 293T 
20 cells. However, it remains to be determined whether p52 iB able to interact with KDCV IN. 

What is the role of the Inip76-IN interaction in die retroviral life cycle? 

The fact that recombinant Mp76 was able to stimulate HDCV-1 IN activity in vitro 

suggests a direct involvement of Inip76 in the integration process. As a chromosome- 

25 associated IN-binding protein, Inip76 may serve aB a docking factor for the PIC. It might 
thus be functionally similar to the Mu phage transposition co-factor MuB that associating 
with the acceptor DNA makes it a preferred target for transposition. It is also conceivable 
that Inip76 might be a part of the retroviral PIC, The experiments are currently underway 
to establish whether Inip76 is an essential factor in HIV replication. If proved to be 

30 essential, Inip76 will constitute a novel target for anti-retroviral therapy directed against 
its interaction with IN. Alternatively, a therapeutic strategy based on modified Inip76 
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protein, designed to capture the viral IN in a catalytically quiesceot complex, may be 
envisaged. 
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Legends to the Figures 

Figure L Extraction of IN from nuclei of 293T-DTaIa cells. 

(A) 293T-IN*ala cells were lysed in 100CSK buffer in the presence of 4 ng/ml digHonin 
5 or 0.5% NP40 on ice for 10 minutes. The supernatant (S) and nuclear pellet CP) fractions 
were recovered and analyzed by western blotting with an anti-IN antibody. The first lane 
contains the total cell extract 

<B) 293T-IN*ala cells were lysed in 100CSK buffer supplemented with 0.5% NF40 on ice 
for 10 minutes, and the extracted nuclei were re-suspended in CSK buffer containing 0 - 
10 500 mM NaCI. After centrifugation, supernatants (S) and nuclear pellets (P) were 
analyzed by western blotting with an anti-IN antibody. The total nuclear protein w&$ 
loaded in the first lane (nuc). 

(C) NP40-penneabilized nuclei from the 293T-IN 9 ala cells, prepared as above > were 
incubated in 100CSK buffer with (+DNaseI) or without (-DNasel) DNAsel 

IS (250 units/ml) at 25 Q C for 10 (10*) or 30 (30*) minutes. Following centrifiigation, the 
supernatants (S) and pellets (P) were separated in an 1 1% SDS PAGE gel, the upper part 
of which was used for the immunpblot to detect MCM3 (91 KDa) and the lower part to 
detect IN (32 KDa). The total cytoplasmic and nuclear protein fractions were loaded in the 
first (cyt) and the second (nuc) lanes, respectively. 

20 (D) NP40-permeabilized nuclei from 293T-IISFala cells were incubated in 100CSK buffer 
with or without DNase I (250 units/ml) for 10 minutes, pelleted by centrifugatioti and 
resuspended in ice-cold hypotonic buffer (2 mM EDTA, 2 mM Hepes lpH7.5]). After 
centrifiigation, supernatants (S) and pellets (P) were analyzed by western blotting with 
anti-IN antibodies. The first lane (nuc) contains total nuclear protein. 

25 

Figure 2. Cross-linking of the IN and IN f complexes with DTSSP. 

(A) The nuclear extract from 293T-IN*ala cells was prepared in the 400CSK buffer, was 
cross-linked with DTSSP and separated in a non-reducing 4-12% SDS PAGE gel. The 
IN-containing cross-linking adducts were detected by western blotting with a polyclonal 
30 anti-IN antibody. Prior to cross-linking, the extract was adjusted to 100 ixgfiriL (lanes 1-5), 
20ng/ml (lanes 6-9) or 4 jig/ml (lanes 10-13) of total protein. The concentration of 
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DTSSP was O.t raM (lanes 3, 7, 11), 0.5 mM (lanes 1, 4, 8, 12) or 2,0 inM (tenes 5, 9, 
13). No cross-tinker was added to the saxxiples in lanes 2, 6 and 10. The sample in lane 1 
was cross-linked in the presence of 0.2% SDS. 

(B) Cross-linking of the FLAG-tagged IN complexes was done in similar conditions as 
5 in (A). Only lanes containing samples cross-linked with 2 mM DTSSP are shown; the 
sample in lane 1 was C^S-Httked to the presence of 0.2% SDS. The p300 cl , plSO* p60* 
the IN monomer bands and the positions of the MW markers are indicated* The «470 kDa 
mark corresponds to the band of the catalytic subuxut of DNAPK (MW 469 kDa), which 
was detected in a separate lane with a monoclonal anti-DNA PKcs aotibody- 

10 

Figure 3. Determination of sizes and molecular weights of the IN complexes. 

(A) Chromatography of nuclear extracts from 293T-IN B ala cells was earned out on a 
calibrated Superdex 200 column. Prior to chromatography, the extract was adjusted to 
550 fig/ml or 30 Mg/rnl of total protein; the collected fractions (1-26) were tested for the 
15 presence of IN by western blotting. The elutdon volumes (V„) and the respective partition 
coefficients (Kay) for the observed IN peaks are indicated. 

(B, C) Deteraimation of the Stokes radii and approximate molecular weights of the IN 
complexes from the experimental Kav values. The partition coefficients for the standard 
proteins were detennined in the same conditions (thyr., thyroglobulin, K* v " 0.039; 
20 ferritin, K*v - 0.17; catalase, Kav - 0.26; aldolase, K* v = 0.28; BSA, bovine serum 
albumin, Kmr = 0.39; chymotr., chymotrypsinogen A, K* v = 0.59). 

Figure 4. Immunoprecipitation of the FLAG-tagged IN from diluted tfudear 
extracts. 

25 (A) Nuclear extracts prepared from metabolically labeled 293T-IN*ala and 293T- 
BSFalaELAG cells were diluted to 200 jig/ml of total protein and immunopredpitated 
with the anti-FLAG M2 antibody and protein G agarose beads for 16 hours at 4 °C. The 
protein was eluted by boiling in SDS PAGE sample buffer and separated in a 4-20% SDS 
PAGE gel. Radioautograph of the gel is presented. 

30 (B) FLAG-tagged IN was immunoprecipitated from a diluted nuclear extract of non- 
labeled 293T-IffalaFLAG cells overnight. The protein was eluted with FLAG peptide in 
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400CSK buffer and cross-linked with DTSSP. The reaction conditions are similar to 
those in Figure 2. 

(C) The protein immunopreciprtated and eiuted as in (B) was subjected to gel filtration on 
a Superdex 200 column. The fractions collected (1-26) were analyzed by western blotting 
5 with anti-lN antibodies. The R, value corresponding to the observed peak was determined 
as in Figure 38. 

Figure 5* Co-immunopreripitation of FLAG- tagged IN with Inip76 from non- 
diluted nuclear extracts. 

10 (A) Nuclear extracts from 293T-IN*alaFLAG and 293T cells (700 ug/ml of total protein) 
were incubated with anti-FLAG M2 antibody and protein G agarose beads for 4 hours. 
The beads were washed as described in materials and methods and bound proteins were 
eiuted with FLAG peptide in 400CSK buffer. The eiuted proteins were concentrated by 
precipitation with TCA, re-dissolved in SDS sample buffer > separated in 4-20% 

15 denaturing PAGE gels and visualized by silver staining. Lane 1: immunoprecipitate of a 
nuclear extract from 60x10* 293T-ESfalaFLAG cells. Lanes 2 and 3: 
immunoprecipitation was done in parallel with nuclear extracts from 293T and 293T- 
IN*alaFLAG cells. The. bands of D% Inip76, the heavy and the light chains of the anti- 
FLAG M2 IgGl antibody as well aB aprotinin (protease inhibitor present in 400CSK 

20 buffer) are indicated Two Bets of molecular weight markers were used in both gels to 
determine the apparent molecular weight of Inip76. Positions of the molecular weight 
markers are shown. 

(B) Nuclear extract from 20 xlO 6 293T-IN s alaFLAG cells (*700 ug/ml total protein) was 
immunopreciphated for either 4.5 hours (left lane) or 18 hours (right lane). The protein 
25 was then eiuted and analyzed as in (A). 

Figure 6. Inip76 Is identical to LEDGF/DFS70/p75, appears to be a component of 
the 61 A complex and stimulates IN activity in vitro. 

(A) The Cooraassie blue-stained FYDF membrane used for anuno-termlnal 
30 microsequencing of Mp76. Hie INr-Inip76 complex was eiuted from protein G agarose- 
immobiJized anti-FLAG M2 antibody with FLAG peptide, separated in reducing 4-20% 
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SDS PAGE gel and transferred onto the PVDF membrane (lane 1). The proteins left on 
the beads after incubation with FLAG peptide were ehxted with SDS sample buffer (lane 
2). The bands corresponding to Inip76 > IN^ the heavy (H) arid light (L) chains of the anti- 
FLAG M2 antibody, aprotinin and the MW markers are indicated. 
5 (B) Deconvohxted and sequence-converted MS/MS spectrum of a doubly charged peptide 
ion (m/z 982.57) obtained from the in-gel tryptic digest of Ihip76 corresponding to the 
LEDGF peptide N425-K442. The observed b- and y-dominant fragment ions are 
indicated (Biemann, 1990). 

(C) Co-immunoprecipitation of LEDGF and FLAG-tagged IN from a nuclear extract of 
10 293T-IN*alaFLAG cells. Immunoprec^pitation was carried with anti-FLAG (lanes 2 and 

2'), anti-LEDGF (lanes 3 and 3') or no antibody (lanes 1 and V). After 4 hours 
incubation, protein G agarose beads with precipitated protein complexes were washed 
with three changes of 400CSK buffer and re-suspended in reducing SDS PAGE sample 
buffer, followed by western blotting to detect IN* and Inip76/LEDGF (lanes l 9 -3*)* The 
15 lanes 1-3 contain the immunoprecipitation superrtatants, 

(D) Elution of the 61 A IN complex from a gel filtration column is shifted after pre- 
incubation with anti-LEDGF antibody. A nuclear extract of 293T-BSF 5 alaFLAG celts was 
preincubated with 3 |xg/ml anti-HA (control IgGl) or anti-LEDGF antibody and 
separated by chromatography on a Superdex 200 column. IN; was detected in the 

20 fractions by western blotting. The void volume of the column was 8.3 ml, approximately 
corresponding to fraction 2, 

(E) Cross-linking of the Ioip76-INf complex with DTSSP. The Lriip7$/INf complex was 
immunoprecipitated from a nuclear extract prepared from 293T-IN*alaELAG cells with 
anti-FLAG M2 antibody and protein G agarose for 4.5 hours. The protein was eluted with 

25 FLAG peptide and incubated with 2 mM DTSSP in the presence (lanes 1 and 1*) or 
absence (lanes 2 and 2') of 0-2% SDS. The cross-linked samples were then separated in a 
non-reducing 4-12% SDS PAGE gel and immunobiotted with polyclonal anti-IN (left 
blot) or monoclonal anti-LEDGF (right blot) antibodies. The positions of INf and Inip76 
as well as the cross-linking adducts pl50 d , &Q0* 7 pHMW* d and pHMW 2 cl are indicated. 

30 Anti-FLAG M2 IgGl present in the sample is detected on the anti-LEDGF western blot. 
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(F) Recombinant Inip76 enhances HIV-1 IN strand transfer activity m vitro. Mini-HIV 
DNA was pre-incubated with HIV-1 IN for 7 minutes at room temperature. Next, 
0-0.8 uM Hiss-tagged Inip76 was added to the reactions that were further incubated at 
37X for 90 minutes. The concentrations of IN and Inip76 used in the reactions are 

5 indicated. The reactions contained 125 ng uiini-HTV DNA, 1 10 mMNaCl, 20 mM Hfipes 
[pH 7.5], 5 mM DTT and 5 uM ZnCl 2 in a final volume of 20 ul. The reactions were 
stopped by addition of 0.5% SDS and 25 mM EDTA and the samples were digested with 
0.25 mg/ml proteinase K at 37 "C for 30 minutes to completely disrupt proteia-DNA 
complexes. DNA was then precipitated with ethanol, re-dissolved in Tris-EDTA and 

10 analyzed by electrophoresis in an 0.8% agarose gel. The positions of the DNA molecular 
weight markers (23.1, 9.4, 6.6, 4.4 and 2.3 kb) are indicated. The gel was stained with 
SybrGoid (Molecular Probes). 

Figure 7. Co-localization of FLAG-tagged IN and Inip76/JLEDGF in 293T- 
15 IN'alaFLAG cells. 

(A) Confocal laser scanning micrographs of a fixed and pemxeabilized cell fluorescently 
stained with a combination of monoclonal anti-LEDGF plus Alexa-488 conjugated anti- 
mouse antibodies to detect Inip76 (green, Mp76) and rabbit polyclonal anti-FLAG plus 
Alexa-555 conjugated anti-rabbit antibodies (red, IN) to localize FLAG-tagged IN. DNA 

20 was stained with To-Pro3 iodide (shown as white). The two-color merged image 
(IN+Inip76) was produced by overlaying the IN and Iaip76 images. 

(B) Both 3Nf and Inip76 are associated with condensed chromosomes during mitosis. 
Immunofluorescent staining was performed as described in (A). 

(C) DNA PKcs and IN f display no significant co-localization. INf (red, IN) was detected 
25 as in (A); DNA PKcs (red, PKcs) was localized with monoclonal anti-DNA PKcs 

antibody plus Alexa-555 conjugated anti-mouse antibody. The two color IN+PKcs image 
is an overlay of the IN and PKcs images. 
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Table 1. 



Sequence MW» Error (Da) c 

4-14 DFKP6DLIFAK 124967 Took 

40-56 LPIFFFGTHETAFLGPK 1921 01 to!S 

57-67 DIFPYSENKEK 1368 ; 66 + * ™ 

76-89 GFNEGLWEIDNNPK 1631 76 

136-143 AVDITTPK 'gJ'JJ ^ 

156-175 ?YS?^£y™T ASVLNK 2046]05 + 0.15 

- 0.16 



425442 NMFLVGEGDSVITQVLNK 1963.00 

8 Tryptic digestion fragments of LEDGRp75. " ' ~ — ~ 

b MW are the calculated monoisotopic masses of the sequences in the table. 

'Errors are the differences between Are calculated monoisotopic masses and those observed in thesampia 
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Integrase tetramers 

Claims 

iJK molecule which comprises a region specifically binding to a protein of the 
hepatoma-derived growth factor family or nucleic adds encoding said protein of the 
hepatoma-derived growth factor family, and suppresses or prevents retroviral 
replication. 

2. The molecule of claim 1 r characterised in that it prevents or suppress the protein of 
the hepatoma-derived growth factor family of Interacting with retroviral integrase. 

3. The molecule of claim 1 or 2 r which comprises a region specifically binding to the 
protein Inip76 or nucleic acids encoding said Inip76, and suppresses or prevents viral 
replication. 

4. The molecule of claim 3, characterised in that it prevents or suppress protein Inip76 
of interacting with retroviral integrase. 

5. The molecule of any the claims 1 to 4, characterised in that it suppresses or prevents 
retroviral replication. 

5. The molecule of any the claims 1 to 4, characterised that it suppresses or prevents 
HIV replication. 

6. A molecule according to any of the claims 1 to 5, which is chosen from the group 
comprising an antibody or any fragment thereof; a small molecule specifically binding to 
Inip76; or nucleic acids encoding said Inip76; a ribozyme against nucleic adds encoding 
Inip76; and anti-sense nucleic acids hybridising with nucleic acids encoding lnlp76. 

7. Use of molecule according to any of claims 1 to 6 for the preparation of a 
medicament to treat or prevent retroviral infection. 

l 
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8. Use of a molecule according to any of claims 1 to 6 for the preparation of a 
medicament to treat or prevent HIV infection. 

9. A method to identify molecules according to claims 3 or 4, comprising exposing fnip76 
or nucleic acids encoding said Inip76 to at least one molecule whose ability to suppress 
of Inip76 of interacting with a retroviral integrase protein is sought to be determined, 
determining binding or hybridising of said molecule(s) to Inip76 or to binding places on 
said integrase of Inip76 protein or to nucleic adds encoding said Inip76 monitoring the 
prevention or suppressing of retroviral replication or integration by the usage of at least 
one of said molecules. 

11.A method to identify molecules according to claims 3 or 4, comprising exposing 
lni P 76 or nucleic acids encoding said Inip76 to at least one molecule whose ability to 
suppress of Inip76 of acting as a cellular cefaclor of retroviral integration is sought to be 
determined, determining binding or hybridising of said molecule(s) to Inlp76 or to 
binding places on said retroviral integrase of Inip76 or to nucleic acids encoding said 
lnip76 monitoring the prevention or suppressing of retroviral integration by the usage 
of at least one of said molecules. 

1ZA method to identify molecules according to claims 3 or 4. comprising exposing 
lnl P 76 or nucleic acids encoding said Inip76 to at least one molecule whose ability to 
suppress of Inip76 of promoting strand transfer activity of HIV is sought to be 
determined, determining binding or hybridising of said molecule(s) to Inip76 or to 
binding places on said integrase protein of lnip76 or to nucleic acids encoding said 
lnip76 monitoring the prevention or suppressing of HIV integration by the usage of at 
least one of said molecules. 



13.A method for the production of a pharmaceutical composition comprising the usage 
of the method according to claims 9-12 and further more mixing said molecule 
identified, or a derivative or homologue thereof, with a pharmaceutical^ acceptable 
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14. A protein complex comprising a retroviral integrase and Inip76. 

15. A protein complex comprising a recombinant retroviral integrase and a recombinant 
lnip76. 

16. A method of stimulating retroviral integration using lnip76 or functional analogues 
thereof. 

17. The use of Inip76 as a cellular cofactor of retroviral integration. 
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Abstract 



In a study of HIV- 1 integrate (IN) complexes derived from nuclei of human cells stably 
expressing Hie viral proton from a synthetic gene it was demonstrated that in the nuclear 
extracts IN exists as part of a large distinct complex with apparent Stokes radius of 61 A, 
which dissociates upon dilution yielding a core molecule of 41 A* The IN complexes 

10 were isolated from cells expressing FLAG-tagged IN. By present invention it was 
demonstrated that the 41 A core is a tetramer of IN, whereas 61 A molecules are 
composed of IN tetramers associated with a cellular protein with an apparent molecular 
weight of 76 kDa, This integrase interacting protein Qmp7&) was found to be identical to 
LEDGF/DFS70/p75, a protein implicated in regulation of gene expression and cellular 

15 stress-response. HTV-1 IN and Imp76 co-localized in the nuclei of human cells stably 
expressing IN. Furthermore* it has been demonstrated that recombinant Inip76 strongly 
promoted strand-transfer activity of HIV-l IN. Our findings reveal that the minimal IN 
molecule in human cells is a tetramer and clearly demonstrates that Inip76 plays a role in 
retroviral integration. 
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